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This study aimed to formulate a suitable statistical model to determine 
acetylcholinesterase enzyme's half-maximal inhibitory (ICso) by a series of 
synthetic compounds. It was done with the same core structure for 
acetylcholinesterase inhibition for anti-Alzheimer’s disease (AD). The ICso 
of eighteen synthesized compounds on anticholinesterase activities was 
obtained and statistical methods were applied. Regression models were fitted 
to the dose-response curve to look for their ICso. Simple linear regression is 
the simplest model for the dose-response curve. However, polynomial 
regression models or non-linear regression models fit the data more 
accurately. The adjusted coefficient of determination (Riaj) was used to 
determine the best model among the linear models, while the root mean 
square error (RMSE) is more suitable in determining the goodness of fit 


between linear and non-linear model. Four-parameter logistic (4-PLR) 
regression often fits the dose-response data closely. Based on the RMSE 
value, a polynomial regression fitted better than 4-PLR with the ICso of 
245.52. 
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1. INTRODUCTION 

Alzheimer's disease (AD) is a general term for dementia that denotes severe memory loss, language 
and other intellectual abilities. The exact cause is unknown but it is very likely related to environmental and 
genetic factors. This disease is 6th leading cause of death in the world [1], [2]. The majority of patients were 
65 years of age or older. Symptoms develop slowly and get worse over time. Alzheimer's patients cannot be 
cured. Instead, treatment can be used to reduce the acceleration of the effects, which involves the treatment of 
cholinesterase inhibitors [3] to improve cognitive function. 

Synthesis of a series of compounds with the same core skeleton is commonly used to improve their 
biological activities. Thus, it is good to discover compounds with good anti-acetylcholinesterase activities for 
Alzheimer’s disease [2]. Multi-well plates are often used in many kinds of biological assays in 
pharmacological research. The parameter of interest is usually the concentration of drug dose corresponding 
to the median response. A measure of a drug’s potency is the half-maximal inhibitory concentration (IC50), 
which is the amount of drug needed to inhibit a biological process by 50%, [4]. The lower the IC50 value, the 
lower the drug concentration is required [5]. A lower drug dose is preferred because a higher drug 
concentration may harm our health. 
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In fitting the dose-response curve, simple linear regression (SLR) is the most straightforward 
statistical tool to estimate the relationships among the variables. It studies the functional dependencies 
between factors. The term functional dependencies imply that the independent variable determines the value 
of the dependent variable. In some situations, simple linear regression does not hold. Hence, polynomial 
regression is introduced in which the relationship of independent variables and the dependent variable is 
modeled by nth degree polynomial. It fits a linear relationship between the variable x and y. 

Non-linear regression is another powerful model that can fit virtually any curve. For example, 
logistic regression fits dose-response data and estimates the expected responses at different doses. A simple 
2 parameter logistic regression model adequately summarizes how a binary response relates to dose. The 
dose-response curve has a lower asymptote of zero and an upper asymptote of one, the limits of the expected 
range of response probabilities. However, when the response variable is continuous, three or more parameters 
would be more appropriate [6]. For generalizing t2-parameter logistic regression model, two more parameters 
need to be added. The upper response asymptote may be less than one, and the lower response may be more 
than zero. The resulting four-parameter logistic (4-PLR) model increases flexibility at a higher optimization 
cost. The 4-PLR model [5] is illustrated as (1): 


(1) 


where a is the minimum value of the response variable, b is the hill slope, c is the inflection point, d is the 
maximum value of the response variable, y is the dependent variable and x is the concentration variable used. 
The hill slope refers to the rate at which the response decreases concerning the dose. The inflection point is 
the dose corresponding to the median response. The 4-PLR has several characteristics [7]. For instance, it is 
perfect symmetry for the sigmoidal curve (S-curve) at its inflection points. It assumes the response is 
symmetric about the median response and it is monotonic with either constantly increasing or decreasing 
with the x variable. The research objective is to formulate a suitable statistical model to determine the IC50 
of a series of synthetic compounds with the same core skeleton for Alzheimer’s disease. 


2. LITERATURE REVIEW 
2.1. Previous research on Alzheimer’s disease 

Many statistical methods have been widely used in the AD research field. An analytic method using 
non-parametric smoothing has proved the scores from several measures can be combined and related to a 
time-index estimation of dementia severity based on disease time course or duration [8]. Mini mental state 
examination (MMSE) has been used extensively to measure the severity of dementia in AD. Research by [9] 
proposed a method based on “time-index” and modeled the change in the MMSE as a function of time and 
showed that AD progression over time could be modeled by cubic or a logarithmic function of MMSE score. 

Statistical parametric mapping (SPM) algorithm in diagnosing the early stage of AD was discussed 
in [10]. A few years later, [11] applied the multivariate analysis based on the principal component analysis to 
evaluate the correlation covariance of activation across brain regions rather than proceeding on a voxel-by- 
voxel basis. Furthermore, research on edge detection, which is an important initial step in the image 
processing of the analysis of CT scan image, which is similar to [10], was done by [12] using Weibull. The 
advantage of this approach over the Gaussian distribution is that it has a limitation to only symmetric shape 
but Weibull distribution has a symmetric and asymmetric shape. 

A study about an acetylcholinesterase inhibitory activity on some herbal medicine to bring in a new 
management for AD by applying Student’s t-test to compare the mean + SD of the herbal medicine has been 
carried out by [13]. In the same year, [14] showed that the long-run longitudinal study and meta-analysis 
examining the broad factors and specific personality facts are associated with the increased risk of AD. 
Another statistical analysis which is known as multivariate analysis of covariance (MANCOVA), is used to 
discuss that active smoking is related to a significantly increased risk of AD by [15]. 

Multi-resolution statistical analysis of brain connectivity graphs in preclinical AD and showed 
significant connective difference between AD individuals [16]. The proposed algorithm helps identify the 
potentially subtle differences between patient groups. On the other hand, meta-analysis is applied to 
investigate whether chronic exposure to aluminum is associated with an increased risk of AD [17]. In the 
same year, the Q test and I2 statistics were used to examine the heterogeneity between selected studies. In the 
same year, [18] proposed a delayed-start analysis approach for the first time implemented in the field of AD. 
They suggested that solanezumab has demonstrated that the expedition program is consistent with a disease- 
modifying effect in mild AD patients. Recently, Bayesian statistics enhanced the efficiency in analyzing the 
study data of a disease-modifying drug in AD [19]. Besides, a systematic review and meta-analysis carried 
out by [20] showed that insomnia is associated with an increased risk for dementia. 
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2.2. Polynomial regression 

A linear regression analysis was used to compute the IC50 for reducing prostaglandin E2 content by 
different test substance concentrations [21]. In order to compare between regressions models, coefficient of 
determination, R? can be used and it can be the goodness-of-fit test of models [22]. Besides, the second 
degree of a polynomial was used to compute the IC50 of a drug by [23]. Meanwhile, [24] proposed a 
nonparametric model to describe the dose-response relationship and fit the curve by polynomial regression. 

Shapiro-Wilk and Anderson Darling tests were adequate for normality when the sample size equals 
20 in simple linear regression [25]. When the normality assumption of a simple linear model is not achieved, 
transformation can be applied [26]. For convenience in inference and improved efficiency in estimating 
simple linear regression, analyses often incur additional assumptions other than the normally distributed 
error [27]. 


2.3. Four-parameter logistic regression (4-PLR) 

The 4-PLR model-fitting approach has been proven to increase variance along with the mean 
response [7]. They illustrated it by radioimmunoassay (RIA) and enzyme-linked immunosorbent (ELISA). 
The extended 4-PLR model in multianalyte analysis (MELISA) is discussed and compared with a more 
simplistic approach in ELISA [28]. An add-on package for R-language that simultaneously fits several non- 
linear regression models focusing on analyzing dose-response curves had been introduced [29]. The 
functionality applies to arbitrary non-linear regression models such as the 4-PLR model. 

A few comparative fit of non-linear models where 4-PLR model is included was carried out on bone 
marrow endothelial cell lines, replicated at various doses of an inhibitory agent [30]. Meanwhile, [31] 
developed an EM algorithm suited for maximum likelihood estimation under the 4-PLR model after 
conceptualizing the problem as a mixture of subpopulations. Some subjects respond regardless of dose, some 
fail to respond regardless of dose, and some fail regardless of respond with a probability that depends on the 
dose. The 4-PLR model hill equation is used to derive IC50 in HDAC4 and ENPP2 high-throughput 
screening data sets and compared with a series of numerical simulations [32]. 

Moreover, [33] had shown that several approaches, including the 4-PLR model to quantity the 
inhibition of drug efflux in a cell assay, can result in different IC50 values. In the same year, [34] presented a 
Solver-based Microsoft Excel template and a stand-alone GUI-based “point and click” program, called HEPB 
to fit the Hill equation. Both programs estimate Hill equation parameters were identical to GraphPad Prism 
and programming language R. When the actual curve is asymmetric, [6] showed that sometimes the 4-PLR 
model offers better performance due to bias-variance trade-off than SPL. Another approach for dose-response 
includes the 4-PLR model has been proposed [29]. Although the 4-PLR model is often used in the dose- 
response curve, from the new algorithm, a portion of cases was proved better described by a multiphasic 
model than the Hill model [35]. 


3. RESEARCH METHOD 
3.1. Data 

The data were obtained from an experiment that undergoes Ellman’s colorimetric method. Some 
minor calculations were needed to determine the inhibitory activities of acetylcholinesterase (AChE) [36]. 
Electric eel AChE acted as the enzyme source, while acetylcholine iodide (ATCT) acted as substrates of the 
reaction. Meanwhile, 5,5”-dithio-bis-2nitrobenzoic acid (DTNB) was used to measure the cholinesterase 
activity. 

Sodium phosphate buffer was added to the 96 wells, followed by the synthesized compound solution 
at different concentrations and AChE solution. The mixture was then incubated for a period at room 
temperature. After that, the reaction was initiated by adding DTNB and ATCI. The hydrolysis of ACh was 
determined by monitoring the formation of yellow 5-thio-2-nitrobenzoate anion due to the reaction of DTNB 
with thiocholine, released by the enzymatic hydrolysis of ACh at a wavelength of 412 nm utilizing a 96-well 
plate reader. The absorbance reading was recorded 10 times with a 1-minute interval by using a microplate 
reader. The percentage of inhibition of AChE was determined by Equation 1, where PI was the percentage of 
inhibition, b. was the slope of control (without any drug) and ba was the slope of synthesized compounds or 
tacrine. 


bc-bą 


PI = -< x 100 (2) 


Eighteen synthetic compounds with the same core structures were synthesized. All the compounds 
labeled 8, 9, 11, 15, 16, 17, and 18 did not show any inhibition within the tested concentration range. Hence, 
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their results were discarded. The results from the remaining synthetic compounds were kept for further 
analysis. 


3.2. Methodology 
The following polynomial regression models were used to fit the PI data set using to determine the 
ICso. The fitted models of all the equations were then written as (3)-(6): 


$ = bo +b x GB) 
f = by + bix + box? (4) 
P= by + bx + box? + b3x? (5) 
p=d+ 4 6) 


1+(3) 


where bp, b4, bz, b3, G, d, and @ are the estimated parameters. 

Since there are several models can be used to fit the PI data set, adjusted coefficient of 
determination (R2, j) Was used to determine the best fitted linear regression models among SLR, quadratic 
regression (QR), and cubic regression (CR) [37]. The Réaj is a modified version of coefficient of 
determination (R*) adjusted for the number of predictors in the model. It increases when the new term 


improves the model and decreases when the predictors improve it less than expected. High R24 j indicates that 


the model is fitted adequately. Hence, the best fit of linear regression always goes to the one with a higher 
Ria: 

The R2, j is suitable for comparing the linear regressions with a different number of terms. 
Unfortunately, using non-linear models such as 4-parameter logistic regression [32]. The root means square 
error (RMSE) is more appropriate in determining the goodness of fit between non-linear and linear models 
[38]. It measures the differences between the predicted values and observed values. In contrast with R24 j the 


lower RMSE indicates a better model fit. The RMSE is written as (7): 


RMSE = ora (7) 


where ĵ;= ith fitted response, y;= ith observed response value, and d f,= degree of freedom of the error term. 
Once the best model was identified, estimated equations were formed with its estimated coefficient. 
Then, substitute the value of 50% at the response variable y. The value of x is the ICso in the unit of 
microgram per millimeter (ug/mL). At y equal to 50%, the QR and CR curves could have more than one 
value of x. Hence, the lowest positive distinct point was chosen as the ICs, and discarded the other [23]. 


4. RESULTS AND DISCUSSION 
4.1. Comparing the linear models 

Table | displays the results of fitting the linear regressions for compound | at each replication. The 
fitted equation was 7 = 21.03 + 0.09339x with the Raj =91.8. It indicated that 91.8% of the inhibition of 
compound 1 could be explained by the concentration used. This result is in line with the research conducted 
by [39], who found that the inhibition of acetylcholinesterase is an essential enzyme for Alzheimer’s disease. 
Moreover, [40] and [41] stated that inhibitor concentration significantly influences activity inhibition. 

Then, the R2, jincreased to 94.3 and 94.2 when QR and CR were fitted to the data, respectively. The 


increment in R2, jSuggested that the QR and CR were better than SLR. Since there were minor differences in 
the value Raj» the QR was selected as the best-fitted model due to its simplicity of lower degree of the 
polynomial model. As a result, the selected model for the first replication was ĵ = 17.01 + 0.1545x — 
0.000082x?. 

At the second experiment run, the R24 j for all three fitted models were lower than the Re, j of the 
first experiment. However, it had similar results with the first run. The cubic regression (CR) and quadratic 
regression (QR) fit better than SLR, but QR and CR were very close to each other. The QR was selected 


again as the best model due to its simplicity. The R24 jfor the three fitted models for n =3 were not similar to 
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each other. Hence, the best-fitted model went to the SLR with the highest Re, jat 90.8 and the fitted equation 
was ĵ = 30.10 + 0.07475x. In summary, the selected model for compound 1 at each replication was listed 
in Table 1. An additional column is the ICs9, which was calculated based on the corresponding selected fitted 
regression. 

Table 2 displays the IC50 of compound 1 for each experimental unit. The ICso for data at the first 
experiment run was 245.52 ug/mL which indicated that 245.52 g/mL of compound 1 was needed to 
perform a 50% inhibition. Compared with n=2 and n=3, the amount needed to have a 50% inhibitory was 
216.6lug/mL and 266.22 ug/mL, respectively. At each replication, the ICso obtained was slightly different 
from each other, but they represent the same inhibitory ability of the compound. Hence, the study of the 
mean of ICso for each compound gave more informative results. The mean of these ICso values was 
242.78 g/mL. Overall, compound 1 needed an average value of 242.78 ug/mL to get the 50% of inhibitory 
[5] with the electric eel acetylcholinesterase under Ellman’s assay [36]. The rest of the synthetic compounds 
were analyzed using the same methods and the result of the ICs) was shown in Table 3. Overall, the lowest 
mean of ICso exhibited by compound 3 (31.84 g/mL) indicated its most substantial inhibition among the 
compounds, followed by compounds 6 (50.82 pg/mL) and 10 (57.15 g/mL). However, the mean of ICs 
obtained for these compounds showed that their inhibition effect was moderate if compared to the common 
standard drug tacrine, used in the Ellman’s assay [42]. 


Table 1. Fitted simple, quadratic and cubic regression for compound | for each replication 


ith experiment run, n Type of regression Ra j Fitted equations 
Simple Linear Regression 91.8 9 = 21.03 + 0.09339x 
1 Quadratic Regression 94.3 9 = 17.01 + 0.1545x — 0.000082x? 
Cubic Regression 94.2 = 19.17 + 0.08546 + 0.000162x? — 0.0000002056x? 
Simple Linear Regression 79.5 9 = 26.54 + 0.07338x 
2 Quadratic Regression 87.0 $= 20.89 + 0.1593x — 0.000115x? 
Cubic Regression 87.4 $= 23.83 + 0.06503x + 0.000218x? — 0.0000002806x° 
Simple Linear Regression 90.8 9 = 30.10 + 0.07475x 
3 Quadratic Regression 89.0 $= 29.83 + 0.07892 — 0.000006x? 
Cubic Regression 86.7 = 30.82 + 0.04721x + 0.000106x? — 0.0000000944x3 


Table 2. IC50 of compound 1 for each experimental run 


ith experiment run, n Selected fitted linear model ICso (ug/mL) 
1 ¥ = 17.01 + 0.1545x — 0.000082x? 245.52 
2 ĵ = 20.89 + 0.1593x — 0.000115x? 216.61 
3 ĵ = 30.10 + 0.07475x 266.22 


4.2. Choosing between linear and non-linear models 
In the earlier discussion, besides R2, j» the better approach for choosing a more suitable linear and 
non-linear model is based on RMSE [43]. The estimated coefficients of 4-PLR for compound | are listed in 


Table 4. For n=1 compound 1, the estimated 4-PLR model which is written as: 


17.94-147.53 


1.15 
x 
1+(ze53) 


9 = 147.53 + 


The RMSE of this estimated equation was 8.19. However, for the polynomial regression models of 
compound 1, the estimated QR had a RMSE of 6.97 for n=1 which is smaller than the RMSE of the 
estimated 4-PLR. In contrast with R24 j small RMSE indicates a better model. For n=2 and n=3, both RMSE 
of selected fitted linear models were lower than fitted 4-PLR. Hence the final fitted model for compound 1 
was best fitted by CR and SLR, respectively as shown in Table 4. 
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Table 3. The best fitted model and the ICs for each synthetic compound 


Adamantane n R24 j Best fitted linear model IC50 Mean of ICs 

1 943 = 17.01 + 0.1545x — 0.000082x? 245.52 

1 2 87.0 Y= 20.89 + 0.1593x — 0.000115x? 216.61 242.78 
3 90.8 = 30.10 + 0.07475x 266.22 
1 97.8 9 = 17.38 + 0.4085x — 0.001191x? + 0.000001x? 114.27 

2 2 91.1 = 16.84 + 0.4279x — 0.001260x? + 0.000001x3 110.03 107.49 
3 97.1 9 = 27.99 + 0.2891x — 0.000759x? + 0.000001x3 98.16 
1 96.6 9 = 13.85 + 0.4535x — 0.001141x? + 0.000001x? 104.82 

3 2 96.5 f = 15.59 + 0.3420x — 0.000315x? 112.21 104.66 
3 993 9 = 14.49 + 0.4385x — 0.000842x? + 0.000001x? 96.95 
1 958 9 = —9.846 + 0.6693x — 0.001040x? 107.31 

4 2 740 9 = 26.86 + 0.1740x 132.99 120.32 
3 98.6 f = 44.82 — 0.4051x + 0.004678x? — 0.000008x? 120.68 
1 910 9 =—23.47 + 3.506x — 0.03957x? + 0.000123x? 30.40 

5 2 81.7 = —4.07 + 2.738x — 0.02872x? + 0.000086x? 26.56 31.84 
3 766 9 = 19.26 + 0.9188x — 0.003155x? 38.56 
1 97.0 9 = 5.238 + 1.171x — 0.004178x? 45.67 

6 2 84.3 Y= 1.681 + 0.9317x — 0.002862x? 64.73 50.82 
3 778 9 = 17.68 + 0.9112 — 0.003392x? 42.05 
1 58.3 f = 29.06 + 0.2315x 90.45 

7 2 85.4 Y= 6.284 + 0.9319x — 0.003115x? 58.25 66.65 
3 76.5 f = 25.64 + 0.5771x — 0.001984x? 51.24 
1 99.7 = 24.10 + 0.6236x — 0.002381x? + 0.000003x3 50.73 

10 2 994 = 21.86 + 0.4421x — 0.000659x? 71.21 57.15 
3 96.2 Ff = 0.596 + 1.289x — 0.006330x? + 0.000009x° 49.52 
1 982 9 = 9.625 + 0.1822x — 0.000095x? 255.68 

12 2 942 § =1.537 + 0.3247x — 0.000435x? + 0.0000002051x? 195.94 229.07 
3 99.1 §=7.370 + 0.2104x — 0.000125x? 235.59 
1 96.7 9 =3.058 + 0.4787x + 0.002700x? — 0.000017x? 80.12 

13 2 89.0 = 0.85 + 0.1192x + 0.00932x? — 0.000041x3 81.16 87.32 
3 776 Y= —5.58 + 0.7470x — 0.001936x? 100.67 
1 88.6 9 = —14.22 + 0.6860x — 0.001371x? + 0.000001x? 119.79 

14 2 92.9 ¥ = —15.93 + 0.6889x — 0.001296x? + 0.000001x? 120.47 115.73 
3 95.6 9 = —0.41 + 0.5763x — 0.001088x? + 0.000001x3 106.94 


Table 4. RMSE of 4-parameter logistic and polynomial regression of compound 1 


4-parameter logistic regression 


ith Experiment run, n 


1 


3 


ith Experiment run, n 


1 
2 
3 


Coefficients RMSE 
a b ĉ d 

17.94 1.15 658.35 147.53 8.19 
21.00 1.07 398.84 105.77 10.72 
31.56 1.65 672.69 131.29 8.54 
Polynomial Regressions Model 

Coefficients RMSE 
ĵ = 17.01 + 0.1545x — 0.000082x? 6.97 
ĵ = 20.89 + 0.1593x — 0.000115x? 8.77 


9 = 30.10 + 0.07475x TAL 


ICs 


249.12 
216.74 
273.00 


ICs 
245.52 
216.61 
266.22 


715 


The comparison between the polynomial regression model and fitted 4-PLR non-linear models was 
listed as the final fitted model of Table 5. Although the fitted model 4-PLR is recommended to fit the dose- 
response curve, it is not always the best-fitted model in all cases. For instance, the 4-PLR was not the best- 
fitted model for compound 1. However, for compound 12, the 4-PLR was the best-fitted model for all 


replications n. 
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Table 5. Final fitted model, ICso and the mean of ICs of each synthetic compound 


Adamantane n RMSE Final Model ICs Mean of ICso 

1 6.97 f = 17.01 + 0.1545x — 0.000082x? 245.52 

1 2 8.77 f = 20.89 + 0.1593x — 0.000115x? 216.61 242.78 
3 TAL ¥ = 30.10 + 0.07475x 266.22 
1 2.97 9 = 17.38 + 0.4085x — 0.001191x? + 0.000001x3 114.27 

2 2 6.39 9 = 16.84 + 0.4279x — 0.001260x? + 0.000001x 110.03 107.49 
3 2.67 9 = 27.99 + 0.2891x — 0.000759x? + 0.000001x3 98.16 
1 6.76 9 = 13.85 + 0.4535x — 0.001141x? + 0.000001x3 104.82 


if 11.09 — 139.86 
2 649 Î= 139.86 +— Fe 97.84 
3 1+ (30980) 


98.19 
j 1.10 — 226.16 
3 117 Î 5226.16 + —— et 91.90 
1+ (e7557) 
ee 3.88 — 89.51 
1 440 ¥= 8951+ NSS 97.19 
1+ (3377) 
4 2 14.79 $ = 26.86 + 0.1740x 132.99 119.29 
38.13 — 93.41 
3 307 ĵ=941+ x vei 127.68 
1+ (15578) 
1 938 $= —23.47 + 3.506x — 0.03957x? + 0.000123x3 30.40 
5 2 1234 9 =-—4.07 + 2.738x — 0.02872x? + 0.000086x? 26.56 31.84 
3 1224 9 =19.26 + 0.9188x — 0.003155x? 38.56 
1 495 9 =5.238 + 1.171x — 0.004178x2 45.67 
6 2 11.55 $= 1.681 + 0.9317x — 0.002862x? 64.73 50.82 
3 10.40 9 = 17.68 + 0.9112 — 0.003392%? 42.05 
1 13.59 $= 29.06 + 0.2315x 90.45 
4 4.36 — 69.65 
7 2 968 ¥=69.65+ x N1853 46.39 62.69 
1+ (57) 
3 768 $ =25.64 + 0.5771x — 0.001984x? 51.24 
: 4.66 — 211.69 
1 133 ¥=21169+————oa5 48.27 
1+ (s0812) 
10 2 214 f= 21.86 + 0.4421x — 0.000659x? 71.21 56.86 
te 21.11 — 80.10 
3 302 <%= 8010+ x K455 51.09 
1+ (s755) 
a 13.73 — 110.94 
1 421 %¥=11094+——~——aa 243.48 
1+ (33107) 
8.76 — 97.68 
12 2 802 Î=97.68+ x sin 189.94 215.93 
1+ (20587) 
rad 8.96 — 106.72 
3 242 Î=106.72+—— Ts 214.38 
1+ (z553) 
N 8.10 L 67-19 
1 163 ĵÎ=6719+ PEED 69.76 
13 1+ (5320 83.86 
2 10.19 $ = 0.85 + 0.1192% + 0.00932x? — 0.000041x3 81.16 
3 13.70 9 = -5.58 + 0.7470x — 0.001936x? 100.67 
1 830 $= -14.22 + 0.6860x — 0.001371x? + 0.000001x? 119.79 
14 2 667 9 = -15.93 + 0.6889x — 0.001296x? + 0.000001x? 120.47 115.73 
3 420 = -0.41 + 0.5763x — 0.001088x? + 0.000001x? 106.94 


5. CONCLUSION 

Simple linear regression is the simplest model to fit the dose-response curve to look for the ICs. 
When it comes to situations where other polynomial regression fits better than SLR, the model of SLR is 
often discarded. As recommended, non-linear regression such as four-parameter logistic regressions 
(4-PLR) often fits the dose-response data adequately than any other type of regression. However, referring to 
RMSE in this research, it was found that sometimes 4-PLR did not fit closely to the data. The linear 
regression or polynomial regressions fitted better than 4-PLR. Based on the RMSE value, it was found that 
that the better model to determine the ICs of a series of synthetic compounds was polynomial regression of 
degree 2. The estimated regression model is ¥ = 17.01 + 0.1545x — 0.000082x*. This model has an RMSE 
value of 6.97. Based on the model, the estimated IC 59 value was 245.52. Further inspection of the percentage 
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of inhibition (PI) for each compound collected must be carried out. More analysis has to be done to have 
better-fitted regression models with a more reliable half-maximal inhibitory concentration (IC5o). 
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